Connectionist Learning of Natural Language Lexical Phonotactics

نویسندگان

  • Ivelin Stoianov
  • John Nerbonne
چکیده

Connectionist learning of natural language words and their phonetic regularities is presented. The Neural Network (NN) model we employ in this problem is the Simple Recurrent Network, trained with the Backpropagation Through Time (BPTT) learning algorithm. During the training, it was assigned the task of predicting the next phoneme given one phoneme at each moment and keeping information of the past phonemes from a given word in a few context neurons. The phonotactics of the Dutch language was studied among others. The shortcomings of some similar previous implementations are explained and successfully overcome. Among the techniques we employed to achieve the much-improved error rate of 1.1% with monosyllabic words and 3.5% with multisyllabic ones are new methods for network response interpretation, an evolutionary approach in training a set of networks, and the exploitation of the word frequencies in training. Finally, an analysis of the phonotactics rules extracted by a trained network is presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The eŒect of probabilistic phonotactics on lexical acquisition

The eŒect of probabilistic phonotactics on lexical acquisition in typically developing children was examined to determine whether a lexical or sublexical level of language processing dominates lexical acquisition. Sixty-one normally achieving 7, 10, and 13 year-old children participated in a word learning task, involving non-words of varying probabilistic phonotactics. Non-words were presented ...

متن کامل

Tree-based Analysis of Simple Recurrent Network Learning

In searching for a connectionist paradigm capable of natural language processing, many researchers have explored the Simple Recurrent Network (SRN) such as Elman(1990), Cleermance(1993), Reilly(1995) and Lawrence(1996). SRNs have a context layer that keeps track of the past hidden neuron activations and enables them to deal with sequential data. The events in Natural Language span time so SRNs ...

متن کامل

Modelling the phonotactic structure of natural language words with Simple Recurrent Networks

Simple Recurrent Networks (SRN) are Neural Network (connectionist) models able to process natural language. Phonotactics concerns the order of symbols in words. We continued an earlier unsuccessful trial to model the phonotactics of Dutch word corpus with SRNs. In order to overcome the previously reported obstacles, a new method for network testing was developed optimal threshold evaluation. Th...

متن کامل

Sublexical Influences on Lexical Development in Children

Previous experimental research has found that adults and infants are sensitive to the likelihood of occurrence of sequences of segments, or probabilistic phonotactics, in the ambient language. One hypothesis that emerges from this finding is that probabilistic phonotactics, a sublexical factor, may influence rate of lexical acquisition. Preliminary results are reported from a study involving 21...

متن کامل

Comparative Study of Degree of Bilingualism in Lexical Retrieval and Language Learning Strategies

This study compares lexical retrieval amongst monolinguals and intermediate bilinguals and advanced bilinguals. It also investigates the possible effects of their language learning strategies on their respective lexical retrieval advantage. The study used a mixed methods design and the groups consisted of 20 Persian near-monolinguals, 20 Persian-English intermediate level bilinguals, and 20 Per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998